"Power tags" in information retrieval

نویسندگان

  • Isabella Peters
  • Wolfgang G. Stock
چکیده

Purpose – Many Web 2.0 services (including Library 2.0 catalogs) make use of folksonomies. The purpose of this paper is to cut off all tags in the long tail of a document-specific tag distribution. The remaining tags at the beginning of a tag distribution are considered power tags and form a new, additional search option in information retrieval systems. Design/methodology/approach – In a theoretical approach the paper discusses document-specific tag distributions (power law and inverse-logistic shape), the development of such distributions (Yule-Simon process and shuffling theory) and introduces search tags (besides the well-known index tags) as a possibility for generating tag distributions. Findings – Search tags are compatible with broad and narrow folksonomies and with all knowledge organization systems (e.g. classification systems and thesauri), while index tags are only applicable in broad folksonomies. Based on these findings, the paper presents a sketch of an algorithm for mining and processing power tags in information retrieval systems. Research limitations/implications – This conceptual approach is in need of empirical evaluation in a concrete retrieval system. Practical implications – Power tags are a new search option for retrieval systems to limit the amount of hits. Originality/value – The paper introduces power tags as a means for enhancing the precision of search results in information retrieval systems that apply folksonomies, e.g. catalogs in Library 2.0 environments.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

بررسی وضعیت ابربرچسب­ها در ساختار وب­سایت­های کتابخانه­های مرکزی دانشگاه­های علوم پزشکی ایران

    Introduction : One of the recommended ways in organizing the information in the websites is the application of Meta Tags. The application of a variety of Meta Tags can affect the precision rate of search engines retrieval. They can also promote the rank of a website. The purpose of the study was to investigate the structure of libraries websites based on Meta Tags in medical science univers...

متن کامل

Information-Theoretic Models of Tagging

In earlier work, we showed using Kulback-Leibler (KL) divergence that tags form a power law distribution very quickly. Yet there is one major observed deviation from the ideal power law distribution for the top 25 tags, a large “bump” in increased frequency for the top 7-10 tags. We originally hypothesized that the “bump” in the data could be caused by a preferential attachment mechanism. Howev...

متن کامل

Retrieval Effectiveness of Tagging Systems

Social tagging is a widespread activity for indexing usergenerated content on Web services. This paper summarizes research on folksonomies and their retrieval effectiveness. A TREC-like retrieval test was conducted with tags and resources from the social bookmarking system delicious, which resulted in recall and precision values for tag-only searches. Moreover, several experimental tag-based da...

متن کامل

Collaborative Annotation for Pseudo Relevance Feedback

We present a pseudo relevance feedback technique for information retrieval, which expands keyword queries with semantic annotation found in the freely available Del.icio.us collaborative tagging system. We hypothesise that collaborative tags represent semantic information that may render queries more informative, and hence enhance retrieval performance. Experiments with three different techniqu...

متن کامل

Are Tags Better Than Audio? The Effect of Joint Use of Tags and Audio Content Features for Artistic Style Clustering

Social tags are receiving growing interests in information retrieval. In music information retrieval previous research has demonstrated that tags can assist in music classification and clustering. This paper studies the problem of combining tags and audio contents for artistic style clustering. After studying the effectiveness of using tags and audio contents separately for clustering, this pap...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Library Hi Tech

دوره 28  شماره 

صفحات  -

تاریخ انتشار 2010